확장된 강화학습 시스템의 정형모델

윤지영; 김동욱; 신건윤; 김상수; 한명묵; Jiyoung Yun; Dong-Wook Kim; Gun-Yoon Shin; Sang-Soo Kim; Myung-Mook Han; 전도영; 송명호; 김수동; Do Yeong Jeon; Myeong Ho Song; Soo Dong Kim

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국인터넷정보학회 논문지

한국인터넷정보학회 논문지

Current Result Document :

한글제목(Korean Title)	확장된 강화학습 시스템의 정형모델
영문제목(English Title)	Formal Model of Extended Reinforcement Learning (E-RL) System
저자(Author)	윤지영 김동욱 신건윤 김상수 한명묵 Jiyoung Yun Dong-Wook Kim Gun-Yoon Shin Sang-Soo Kim Myung-Mook Han 전도영 송명호 김수동 Do Yeong Jeon Myeong Ho Song Soo Dong Kim
원문수록처(Citation)	VOL 22 NO. 04 PP. 0013 ~ 0028 (2021. 08)
한글내용 (Korean Abstract)	강화학습은 한 환경에서 에이전트가 정책에 따라 액션을 취하고 보상 함수를 통해 액션을 평가 및 정책 최적화 과정을 반복하는 Closed-Loop 구조로 이루어진 알고리즘이다. 이러한 강화학습의 주요 장점은 액션의 품질을 평가하고 정책을 지속적으로 최적화 하는 것이다. 따라서, 강화학습은 지능형 시스템, 자율제어 시스템 개발에 효과적으로 활용될 수 있다. 기존의 강화학습은, 단일 정책, 단일 보상함수 및 비교적 단순한 정책 업데이트 기법을 제한적인 문제에 대해 제시하고 적용하였다. 본 논문에서는 구성요소의 복수성을 지원하는 확장된 강화학습 모델을 제안한다. 제안되는 확정 강화학습의 주요 구성 요소들을 정의하고, 그들의 컴퓨팅 모델을 포함하는 정형 모델을 제시한다. 또한, 이 정형모델을 기반으로 시스템 개발을 위한 설계 기법을 제시한다. 제안한 모델을 기반으로 자율 최적화 자동차 내비게이터 시스템에 적용 및 실험을 진행한다. 제시된 정형 모델과 설계 기법을 적용한 사례연구로, 복수의 자동차들이 최적 목적지에 단 시간에 도착할 수 있는 진화된 내비게이터 시스템 설계 및 구현을 진행한다.
영문내용 (English Abstract)	Reinforcement Learning (RL) is a machine learning algorithm that repeat the closed-loop process that agents perform actions specified by the policy, the action is evaluated with a reward function, and the policy gets updated accordingly. The key benefit of RL is the ability to optimze the policy with action evaluation. Hence, it can effectively be applied to developing advanced intelligent systems and autonomous systems. Conventional RL incoporates a single policy, a reward function, and relatively simple policy update, and hence its utilization was limited. In this paper, we propose an extended RL model that considers multiple instances of RL elements. We define a formal model of the key elements and their computing model of the extended RL. Then, we propose design methods for applying to system development. As a case stud of applying the proposed formal model and the design methods, we present the design and implementation of an advanced car navigator system that guides multiple cars to reaching their destinations efficiently.
키워드(Keyword)	내부전파경로 탐지 페이지랭크 알고리즘 설명가능한 인공지능 원격 데스트톱 프로토콜 특징 추출 Lateral Movement Pagerank Algorithm Explainable AI Remote Desktop Protocol Feature Extraction 강화학습 확장된 강화학습 모델 정형 모델 설계 기법 진화된 네비게이터 시스템 Reinforcement Learning (RL) Advanced RL Formal Model Design Methods Advanced Navigator System
파일첨부	PDF 다운로드